# Robust speech processing

Wav2vec2 Large Robust 6 Ft Age Gender Finetuned Gtzan
An audio classification model based on the wav2vec2 architecture, fine-tuned on the privateSLI dataset for age and gender recognition tasks
Audio Classification Transformers
W
languageresearch
15
0
Wav2vec2 Xls R 300m Indonesian
Apache-2.0
An automatic speech recognition model fine-tuned on Indonesian speech data based on Facebook's XLS-R-300M model
Speech Recognition Transformers Other
W
Wikidepia
4,486
1
Xls R 300 Sv Cv7
Apache-2.0
This is an automatic speech recognition model fine-tuned on the Swedish Common Voice 7.0 dataset based on facebook/wav2vec2-xls-r-300m
Speech Recognition Transformers Other
X
patrickvonplaten
19
0
Wav2vec2 Xls R 1b Korean
Apache-2.0
This model is a Korean automatic speech recognition model fine-tuned on the KRESNIK/ZEROTH_KOREAN - CLEAN dataset based on facebook/wav2vec2-xls-r-1b
Speech Recognition Transformers Korean
W
anantoj
20
2
Wav2vec2 Large Xlsr 53 Demo Colab
Apache-2.0
This model is a speech recognition model fine-tuned on the common_voice dataset based on facebook/wav2vec2-large-xlsr-53, primarily used for robust speech event recognition.
Speech Recognition Transformers
W
emre
16
0
Wav2vec2 Large Xls R 300m Romansh Sursilvan
Apache-2.0
Automatic speech recognition model fine-tuned on the Romansh Sursilvan dialect dataset based on facebook/wav2vec2-xls-r-300m
Speech Recognition Transformers
W
infinitejoy
15
0
Wav2vec2 Large Xls R 300m Hindi
Apache-2.0
This is a Hindi speech recognition model fine-tuned on Hindi datasets based on the facebook/wav2vec2-xls-r-300m model, supporting Hindi speech-to-text tasks.
Speech Recognition Transformers Other
W
ravirajoshi
26
0
Wav2vec2 Large Xls R 300m Latvian
Apache-2.0
This is an automatic speech recognition model fine-tuned on Latvian datasets based on facebook/wav2vec2-xls-r-300m, achieving a WER of 16.98% on the Common Voice 7 test set.
Speech Recognition Transformers Other
W
infinitejoy
222
1
Wav2vec2 Large Xlsr Coraa Portuguese Cv7
Apache-2.0
Portuguese speech recognition model fine-tuned on the Common Voice dataset based on Edresson/wav2vec2-large-xlsr-coraa-portuguese
Speech Recognition Transformers Other
W
lgris
24
0
Wav2vec2 Xls R 300m Turkish Tr Small
Apache-2.0
This is a Turkish speech recognition model fine-tuned on the Common Voice dataset based on the facebook/wav2vec2-xls-r-300m model
Speech Recognition Transformers
W
emre
19
0
Wav2vec2 Indonesian Javanese Sundanese
Apache-2.0
This is a multilingual speech recognition model supporting Indonesian, Javanese, and Sundanese, fine-tuned from facebook/wav2vec2-large-xlsr-53.
Speech Recognition Transformers Other
W
indonesian-nlp
298
6
Wav2vec2 Xls R Pt Cv7 From Bp400h
Apache-2.0
This is a Portuguese automatic speech recognition (ASR) model based on the wav2vec2 XLS-R architecture, fine-tuned on the Common Voice 7 dataset, achieving a word error rate (WER) of 12.13% on the test set.
Speech Recognition Transformers Other
W
lgris
94
0
Wav2vec2 Large Xls R 1b Indonesian
Apache-2.0
An automatic speech recognition model fine-tuned on the Common Voice Indonesian dataset based on facebook/wav2vec2-xls-r-1b
Speech Recognition Transformers Other
W
kingabzpro
14
1
Xls R 2B Te
Apache-2.0
This is a Telugu automatic speech recognition (ASR) model fine-tuned based on the facebook/wav2vec2-xls-r-2b model, trained on the OpenSLR SLR66 dataset
Speech Recognition Transformers Other
X
chmanoj
20
0
Xls R 300m Dv
Apache-2.0
This is an automatic speech recognition model fine-tuned on the Common Voice 8 Dhivehi dataset based on the facebook/wav2vec2-xls-r-300m model
Speech Recognition Transformers Other
X
shahukareem
14
0
Wav2vec2 Large Xls R 300m Sl With LM V1
Apache-2.0
This is an automatic speech recognition (ASR) model fine-tuned on the Slovenian language (Common Voice 8.0) dataset based on the facebook/wav2vec2-xls-r-300m model, with improved recognition performance through language model (LM) integration.
Speech Recognition Transformers Other
W
DrishtiSharma
25
0
Wav2vec2 Large Xls R 300m Hi Cv8
Apache-2.0
This is an automatic speech recognition (ASR) model fine-tuned on the Hindi Common Voice 8 dataset based on the facebook/wav2vec2-xls-r-300m model.
Speech Recognition Transformers Other
W
DrishtiSharma
25
0
Wav2vec2 Large Xls R 300m Cv8 Nl
Apache-2.0
An automatic speech recognition model fine-tuned on the Common Voice 8 Dutch dataset based on facebook/wav2vec2-xls-r-300m, including a 6-gram KenLM language model
Speech Recognition Transformers Other
W
RuudVelo
22
0
Wav2vec2 Large Xlsr 53 Demo Colab
Apache-2.0
This is an automatic speech recognition model based on the wav2vec2 architecture, specifically optimized for the Tamil language and supporting Nepali speech recognition tasks.
Speech Recognition Transformers Other
W
Mahalakshmi
17
0
Xls R Nl V1 Cv8 Lm
This is an automatic speech recognition model based on the XLS-R architecture, specifically optimized for Dutch and Flemish, incorporating a 5-gram language model to improve recognition accuracy.
Speech Recognition Transformers Other
X
FremyCompany
14
3
Galician Xlsr
Apache-2.0
This model is an automatic speech recognition model fine-tuned on the Galician dataset based on facebook/wav2vec2-xls-r-300m, achieving a WER of 11.31% on the Common Voice 8.0 test set.
Speech Recognition Transformers Other
G
Akashpb13
110
1
Wav2vec2 Large Xls R 300m Sat Final
Apache-2.0
This is an automatic speech recognition model fine-tuned on the MOZILLA-FOUNDATION/COMMON_VOICE_8_0 - SAT dataset based on facebook/wav2vec2-xls-r-300m, supporting Santali (Ol Chiki) language.
Speech Recognition Transformers Other
W
DrishtiSharma
28
0
Wav2vec2 Large Xls R 300m Br D2
Apache-2.0
A speech recognition model fine-tuned on Breton (Common Voice 8.0) based on facebook/wav2vec2-xls-r-300m
Speech Recognition Transformers Other
W
DrishtiSharma
21
0
Wav2vec2 Large Xls R 1b Cv8 Mt
Apache-2.0
An automatic speech recognition model fine-tuned on the Common Voice 8 Maltese dataset based on facebook/wav2vec2-xls-r-1b
Speech Recognition Transformers Other
W
RuudVelo
17
0
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase